Lazy Query Expansion
نویسنده
چکیده
An information retrieval or document base system has to somehow deal with various phenomena of equivalence of some strings. These are lowercase versus uppercase matching, morphological inflection, derivation, and synonymy of words: e.g., given a query computer, find Computers, computing, workstation. The latter problems are very important in languages with richer morphology and less stable terminology than in English. Also, much better recall is achieved by matching hyponyms and hypernyms using a thesaurus, e.g., given a query computers, find also supercomputer, microcomputer, mainframe, machine, device, processor, UNIX, etc. Technically, this can be handled at the time of indexing by reducing related strings to a common form, or at the time of query processing by expanding the query with the whole set of the related forms. We argue for that the latter way allows for greater flexibility and easier maintenance, while being more affordable than it is usually considered. We propose to expand the query with only those words that really appear in the document base. Our experiments with a thesaurus-based information retrieval system we are developing for the Senate of Mexican Republic show only insignificant increase of the real user queries on average with the 200-megabyte document base of the Senate, in spite of highly inflective Spanish language.
منابع مشابه
Query expansion based on relevance feedback and latent semantic analysis
Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...
متن کاملQEA: A New Systematic and Comprehensive Classification of Query Expansion Approaches
A major problem in information retrieval is the difficulty to define the information needs of user and on the other hand, when user offers your query there is a vast amount of information to retrieval. Different methods , therefore, have been suggested for query expansion which concerned with reconfiguring of query by increasing efficiency and improving the criterion accuracy in the information...
متن کاملQuery Architecture Expansion in Web Using Fuzzy Multi Domain Ontology
Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...
متن کاملLazy Learning for Classification Based on Query Projections
We propose a novel lazy learning method called QPAL. QPAL does not simply utilize a kind of distance measure between the query instance and training instances as many lazy learning methods do. It attempts to discover useful patterns known as query projections, which are customized to the query instance. The discovery for useful QPs is conducted in an innovative way. QPAL can guarantee to discov...
متن کاملPackage ‘ lazy ’
Description By combining constant, linear, and quadratic local models, lazy estimates the value of an unknown multivariate function on the basis of a set of possibly noisy samples of the function itself. This implementation of lazy learning automatically adjusts the bandwidth on a query-by-query basis through a leave-one-out cross-validation.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computación y Sistemas
دوره 6 شماره
صفحات -
تاریخ انتشار 2002